A new measure of predictive ability for survival models
نویسندگان
چکیده
In clinical research, an understanding of prognostic factors is important in the design and analysis of clinical trials and retrospective reviews of clinical experience. The results of prognostic factor studies are usually summarized in the form of statistics resulting from statistical significance testing, i.e. estimated parameters, confidence intervals, and p-values. These statistics do not inform us whether prognostic factor information will lead to substantial improvement in the prognostic assessment. Predictive ability measures can be used for this purpose since they provide important information about the practical significance of prognostic factors. R-type indexes are the most familiar forms of such measures in survival models, but they all have limitations and none is widely used. Bura and Gastwirth (2001) [1] proposed a new predictive ability measure, named total gain (TG), for a logistic regression model. TG is based on the binary regression quantile plot, otherwise known as the predictiveness curve, which was first proposed by Copas (1999) [2]. Gu and Pepe (2009) [3] showed that TG is related to the ROC summary index, but it does not have the reported shortcomings of the ROC index. In this paper, we extend the proposed TG measure to survival models and explore its properties using simulations and real data. In survival models, the TG statistic is a non-negative, unitless measure of the total cumulative distance between the average survival probability, as expressed by the Kaplan-Meier (KM) estimates of the survival probability, at a fixed time point and the estimated survival probabilities from a given model. Standardised TG ranges from 0 (no explanatory power) to 1 (‘perfect’ explanatory power). In our simulation studies, we investigated the impact of censoring, covariate distribution and influential observations on the measure. The results of our simulations show that unlike most of the other R-type predictive ability measures, TG is independent of censoring and follow-up time. TG also increases as the effect of a covariate increases, but it is adversely affected by the categorisation of continuous prognostic factors. Finally, we applied TG to quantify the predictive ability of prognostic models developed in several disease areas. On balance, although TG lacks the intuitive interpretation of the explained variation measures, our results indicate that the estimates of the measure are within the reasonable range of the estimates of explained variation measures and can be recommended as an alternative measure to quantify the predictive ability in survival models.
منابع مشابه
Credit Risk Predictive Ability of G-ZPP Model Versus V-ZPP Model
Credit risk management is becoming more and more important in recent years. When a company deals with a financial problem, it may not be able to fulfill its financial obligations, which can cause direct and indirect financial losses to shareholders, creditors, investors and other people in the community. Advanced credit risk models that are based on market value include improving credit quality...
متن کاملExtracting Predictor Variables to Construct Breast Cancer Survivability Model with Class Imbalance Problem
Application of data mining methods as a decision support system has a great benefit to predict survival of new patients. It also has a great potential for health researchers to investigate the relationship between risk factors and cancer survival. But due to the imbalanced nature of datasets associated with breast cancer survival, the accuracy of survival prognosis models is a challenging issue...
متن کاملPredicting Bankruptcy of Companies using Data Mining Models and Comparing the Results with Z Altman Model
One of the issues helping make investment decisions is appropriate tools and models to evaluate financial situation 0f the organization. By means of these tools, investors can analyze financial situation of the organization and identify financial distress or an ideal condition, they become aware of making decisions to invest in appropriate conditions. The main objective of this study is to ev...
متن کاملPredictive Ability of Statistical Genomic Prediction Methods When Underlying Genetic Architecture of Trait Is Purely Additive
A simulation study was conducted to address the issue of how purely additive (simple) genetic architecture might impact on the efficacy of parametric and non-parametric genomic prediction methods. For this purpose, we simulated a trait with narrow sense heritability h2= 0.3, with only additive genetic effects for 300 loci in order to compare the predictive ability of 14 more practically used ge...
متن کاملThe extension of total gain (TG) statistic in survival models: properties and applications
BACKGROUND The results of multivariable regression models are usually summarized in the form of parameter estimates for the covariates, goodness-of-fit statistics, and the relevant p-values. These statistics do not inform us about whether covariate information will lead to any substantial improvement in prediction. Predictive ability measures can be used for this purpose since they provide impo...
متن کامل